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Abstract. We generalize various notions of stability of invariant sets of dy- 
namical systems to invariant measures, by defining a topology on the set of 
measures. The defined topology is similar, but not topologically equivalent to 
weak* topology, and it also differs from topologies induced by the Ricsz Rep- 
resentation Theorem. It turns out that the constructed topology is a solution 
of a limit case of a p-optimal transport problem, for p = oo. 



1. Introduction 

The motivation for this paper is two-fold. The first motivation is related to 
dynamical systems, and to finding the "right" topology on the set of measures, 
namely a topology which replicates certain properties of a dynamical system on a 
metric space to the induced dynamical system on the set of measures. The second 
motivation is to investigate an alternative formulation of the well-known Optimal 
transport problem. 

The "right" topology on the set of measures. Let / be a continuous 
function (i.e. a discrete dynamical system) on a metrizable topological space X, 
and /jj be the induced function on the set of Borel probability measures P(X) on X. 
Then /j is a discrete dynamical system on P(X). Study of the dynamical system 
/jt can often give useful information on the system /, as was shown for example in 
0, |12j . Now, a number of fundamental properties of a dynamical system / are not 
analogous to similar properties of the dynamical system /j , in any of the standard 
topologies, such as the weak* topology on P(X). We give two examples. 

Let x £ X be a sink or a source of /, and let 8 X be the probability measure 
supported on {x}. Then, typically, S x is not a sink or a source of /j in the weak* 
topology. A detailed discussion of this is in Section 5. 

Now, let A C X be a closed invariant set of /, and in addition an attracting set. 
Let A$ be the set of all measures in P(X) such that its support is a subset of A. 
One can see (and we show in Section 5) that A$ is an invariant set of /j, but is 
typically not an attractor with respect to the weak* topology. 

The same conclusions as in the last two examples hold for flows and semiflows. 
They also hold for other usual topologies on the set of measures, induced by topolo- 
gies on the set C*(X) of all bounded linear functionals on the set of continuous real 
valued functions C(X), and the Riesz Representation Theorem. 

Our goal is to find the "right" topology on the set of measures which would 
naturally generalize various notions of stability and attraction to dynamical systems 
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on the set of measures. The topology should be close enough to the weak* topology, 
so that one can find nontrivial compact sets of measures, and use it in various 
applications. Such a topology constructed here is called dynamical topology. We will 
also show that, by identifying stable or attracting sets of measures (with respect to 
the dynamical topology) rather than sets of points, we get better insight and more 
information on behavior of a chosen dynamical system. 

The oo-optimal transport problem. Let X be a metric space with a metric 
d, and fj,, v be two Borel probability measures on X. Then the p-Wasserstein 
distance W p , where 1 < p < oo, is the function 

(1.1) C£( 7 )= / d(x,ymx,y), 

J XxX 

(1.2) W p (p, v) = inf{C p ( 7 ) | 7 6 TQi, u)}. 

The set T(n,is) is the set of all transports; i.e. the set of all Borel probability 
measures 7 on X x X, such that iritf) = M, 7T2j7 = v, where tti, 7T2 are projec- 
tions of X x X to the first, resp. second variable (transports are sometimes called 
"couplings" in the probability and some ergodic theory literature). 

The measure 7 which minimizes (|1.2p is a solution of the optimal transport 
problem in the p-norm. Intuitively, the minimizer 7 is the measure which describes 
how the points in the support of fi are coupled to the points in the support of v, 
so that the p-norm of the coupling distances is minimal. 

There is rich literature on the optimal transport problem, including proofs of 
existence, uniqueness, and properties for various spaces X and norms 1 < p < 00 
(see e.g. [I], [2], [4], [11]). (The metric d(x,y) can also be replaced by a more 
general cost function c(x,y).) 

In this paper we study the case p = 00. This case may have implications to 
various optimization problems where the cost of transport does not depend on 
mass to be transported, but only on the maximal transport distance. 

For all 1 < p < 00 the formula (|1.2|) generates a metric on the space of the 
probability measures P(X), called p-Wasserstein metric. One can show that p- 
Wasserstein metrics are for 1 < p < +00 uniformly equivalent to each other and 
to the Prokhorov metric, and so generate the weak* topology on P(X) (see e.g. 
[Sj). We will show that they are neither uniformly nor topologically equivalent to 
the metric and topology in the case p = +00, and that the topology in the case 
p = +00 is the dynamical topology defined in the first part of the paper. 

The structure of the paper. We start with definitions of dynamical metric 
and dynamical topology, which are the main tools of this paper. Then we show 
that the dynamical topology indeed differs from the weak* topology, and from 
topologies induced by the Riesz Representation Theorem. In the third section, we 
discuss various properties of the dynamical topology. We focus on characterization 
of convergence of the set of measures with respect to the dynamical topology in 
the fourth section. The proof of characterization of convergence is combinatorial in 
character (as such, it could have implications in the combinatorial ergodic theory). 
We continue with proofs that the dynamical topology indeed gives natural gener- 
alizations of various notions of stability to dynamical systems on sets of measures. 
Finally we analyze the oo-optimal transport problem, and we show that the 00- 
Wasserstein metric generated by the oo-optimal transport problem generates the 
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dynamical topology on the P(X). We also prove existence of a solution of the 
oo-optimal transport problem. 

2. Definition of the dynamical topology 

In this paper X is always a compact metric space, equipped with a Borel o- 
algebra into a measurable space, and d is its metric. Let M(X) be the space of 
all finite Borel measures on X, and P(X) C M(X) the space of all probability 
measures. Let C(X) be the normed (Banach) space of all real valued continuous 
functions / : X — » R. Then M(X) can be naturally embedded in the dual space of 
all bounded linear functionals on C(X). 

We denote by t w the weak* topology M(X), and by t u the uniform topology 
(i.e. the topology induced by the sup-norm on the dual C*(X) of C(X)). The 
topology t u is much finer than the topology t w , and as such is seldom used in the 
dynamical systems. We will show that both topologies differ from the topology to 
be constructed. We use the notation "w-", "u-", and "d-" ("d" for the dynamical 
topology, yet to be defined) when referring to properties of a set or a sequence 
in various topologies. In particular, w-convergent, u-convergent, and d-convergent 
means that a sequence of measures in P(X) is convergent with respect to weak*, 
uniform, or dynamical topology respectively. 

We now define dynamical metric and topology on the set P(X). Let I be the 
unit interval [0,1], and A the Lebesgue measure defined on the family of Borel- 
measurable subsets of I. Given two functions /, g : I — > X, we define their distance 
as 

D(f,g) = sup d(f(a), g(a)). 

ael 

Distance D is well defined because of compactness of X. It is straightforward to 
check that D is symmetric, and that it satisfies the triangle inequality D(f,g) + 
D(g,h)>D(f,h). 

If / : I — ► X is a (Borel) measurable function, then /(A = fi denotes the measure 
H(A) = A(/ _1 (A)) for all measurable AC X. 

Definition 1. We define the distance between two probability measures fi,v as 
A(/x, v) = inf D(f,g), where infimum goes over all measurable junctions f, g : I — > 
X , satisfying /i — /jA, v = g$\. 

We will now prove in several steps that P(X) equipped with A is indeed a metric 
space. We will use the following form (as in e.g. |3], Proposition 2.17) of the well 
known isomorphism Theorem ([7j, Theorem C of Section 41.): 

Theorem 1. If X is a compact metric space with a nonatomic Borel probability 
measure fi, then it is isomorphic (in the category of measure spaces) to the the 
Lebesgue measure A on the family of Borel measurable subsets of I. 

Corollary 1. Suppose fi is a Borel probability measure on a compact space X. 
Then there exists a Borel measurable function f : I — > X such that fi = /jA. 

This implies that the infimum in the definition of A goes over a nonempty set, 
hence A is well defined. 

In the rest of the paper, "measurable" will always mean " Borel-measurable" . 
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Lemma 1. Suppose fi £ P(X), and let 31,32 : I —> X be measurable functions, 
such that /i = <?ijA = g 2 $\. Then for each e > 0, there exist measurable, X-invariant 
functions h±,h 2 : I — > 1 such that 

(2.1) D(g 1 ah 1 ,g 2 oh 2 )<e. 

Proof. Let Ci,C 2 , Ck be any pairwise disjoint, measurable cover of the support 
of fi, such that each C$ has diameter less than e (such a cover exists because 
of compactness of X). Without loss of generality we also assume that for all i, 
niCi) > 0. We define a set Y e C J x I, and a Borel probability measure v e on Y e , 
with 

Y e ,i = 9i X {Ci) x g^iQ), Y e = Uti y e.<. 

where A 2 is the Lebesgue measure on 7x7. Since \ 2 {Y e ^) — X(g^ 1 (Ci))-X(g 2 ~ 1 (Ci)) = 
fj,(Ci) 2 , one can easily check that v e is a probability measure. By definition, for any 

a e Y e , 

(2-2) \gi(7ri(a))~g 2 (n 2 (a))\<e, 

where iri,ir 2 : I 2 — > I are coordinate projections. For any measurable A C I, 

v e {AxI) = EtiA(^n.gr 1 (a))-A(< ?2 - 1 (C l ))/M(C i ) = ZUKAng^iQ)) = A (A), 
and similarly v e {l x A) = X(A), hence 

(2.3) TTl^e = it 2 $v e = X. 

By using Corollary [TJ we find a measurable function h* : I — > I 2 such that 
f e = fc|A, and v e (I) C F e . Now ([272]) and |2~3)) imply that hi = -Kioh* , h 2 = n 2 oh* 
are the required functions. □ 

Proposition 1. TTie function A is a metric on P(X). 

Proof. Since D is symmetric, so is A. The claim A(/i, fi) = is trivial. 

Now suppose that A(/it, ^) = for /.t ^ v. Choose any h € C(X), and e > 0. 
Because of compactness of X, /i is uniformly continuous, and there exists S > such 
that d(x, y) < 6 implies \h(x) — h(y)\ < e for all x,y £ X. Now we find f,g:I—>X 
such that /1 = /jjA, 1/ = g A, and D(f,g) < 5. Now 



hdfi — / /icif 
x Jx 



h(f(t))dt - / %(t))dt 
/ J 1 



< lHf(t))-h{g{t))\dt <e. 



Since e and h were arbitrary, we see that fj,, v are identical linear functionals on 
C(X). Now the Riesz Representation Theorem implies that /i = v. 

Finally, we prove the triangle inequality. Let r\,[i,v £ P(X). Choose arbitrary 
e > 0, and assume that fi, gi, g 2 , f 2 ■ I — > X are measurable functions such that 
V = ZijA, \i = 3ijA = g 2(t X, v = / 2t jA, and such that 

(2.4) Afa./x) >£>(/i, fl i)-e, A(/x, i/) > D(g 2 , f 2 ) - e. 

Now we find hi, h 2 as in Lemma[T] Note now that for arbitrary functions f*,g* ■ 
I X, h* : I I, D(f*,g*) > D{f* o h*,g* o ft*), so first applying that, then 
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(|2.ip . and finally the triangle inequality for D we get: 

£>(/i,3i) + D(g 2 , h) > D{fx ° h u gi o fa) + D( ff2 o fa, / 2 o fa) > 

> D(fi o hi,g\ o fa) + (D(5i o fa, 52 o fa) - e)+ 
+ D(g 2 o fa, h o fa) 

(2.5) >D(f 1 oh 1 J 2 oh 2 )-e. 

Since fa, h 2 are A-invariant, (/i o fa)jA = 77, (/ 2 ° fa)(|A = v, hence 

(2.6) 0(/ioAi,/joft3)> Afai/) 

Combining ([2~4]) . ((23| and (j2T6j) we conclude that A(?7, /i) + A(/i, 1/) > A(?7, v) - 
3e. Since e was arbitrary, A satisfies the triangle inequality. □ 

Definition 2. The topology on P(X) induced by the metric A is called dynamical 
topology. We denote it by Td- 

3. Properties of the dynamical topology 

We now compare different topologies on P{X), and investigate elementary prop- 
erties of the dynamical topology. 

Proposition 2. The weak* topology on P(X) is coarser than Td- Equivalently, 
d-convergence implies w- convergence. 

Proof. Assume that a sequence \i n £ P{X) d-converges to a /i £ P{X). Choose an 
arbitrary / £ C(X), and e > 0. Since X is compact, / is uniformly continuous, 
and there is S > such that \x — y\ < S implies \f{x) — f{y)\ < e. Now choose 
no large enough such that A(/i„, jj) < 5/2 for n > hq, and for given n choose g n , g 
such that (U„ = g n $\ A* = 50-^; an d D(g n ,g) < S. Now 



l^n(J) ~ = 



fdun - / fdfi 
x J x 



if 9n - f g)d\ 



< 



< j\f°g n -f°g\d\< j sd\<e, 

therefore fj, n w-converges to /j. □ 

The next simple example shows that the dynamical topology differs from both 
the weak* and uniform topology on any nontrivial X (i.e. X with more than one 
element). We will see that the dynamical topology refines the weak* topology in a 
very different way than the uniform topology. 

Example 1. Suppose that fx n is a sequence of atomic measures, each supported on 
k points 2" £ X, i = 1, k, n £ N . We write 

i=X,...,fc 

where x™ £ X, pf > 0, and Yli=iPi = 1- Suppose now that fi n w-converges to 
/J = X)j=i kli^iVi)- Without loss of generality we can assume that p™ — » and 
xf Hi- It is easy to check that fx n is u-convergent if and only if for all i, x™ is 
eventually constant (i.e. there exists n$ such that for all n > no, x" — yi). 

On the other hand, one can check that the sequence (fi n ) is d- convergent, if and 
only if for all i, p™ is eventually constant. 
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We also deduce that in this example is at the same time u- and d-convergent, 
if and only if it is eventually constant. 

Proposition [2] and the Example above imply the following conclusion. 

Corollary 2. (i) If X has at least two elements, then t w C t^, but not equal to it; 
(ii) If X is not a finite set, then (£. r u and t u <f_ t^; 

Now we discuss w-connectedness and w-compactness of P(X). 

Proposition 3. (i) If X is path- connected, then P(X) is d-path connected. 

(ii) If X has at least two elements, than P(X) with the dynamical topology is 
not d- sequentially compact, and not d-compact. 

Proof, (i) Suppose that X is path-connected. Let fi, \i! be any two measures in 
P(X), and choose any measurable f,f':I—>X such that \x = /jj/, // = f'^I. Now, 
since X is path connected, there is a measurable function g : I x I — > X, such that 
g(.,0) — /, g(., 1) = /', and such that t i— > g(a,t) is continuous for every a. Now 
the function 1 1— > g(.,t)$\ is a d-continuous curve in P(X), connecting \i and fj,'. 

(ii) We construct the following simple example: choose two points x y in X, 
and the sequence of measures 

1 n — 1 

(3.1) n n = -5 X H 5 V , 

n n 

where S x , S y are atomic measures concentrated in x, y. Now /i„ w-converges to 
H = 5 y . Since A(/i„, /i) = d(x, y), neither /x n nor any subsequence of /x n d-converge 
to \i. Proposition^ implies that /i„ has no convergent subsequence, hence P(X) is 
not d-sequentially compact. Since P(X) is metrizable, P(X) is not d-compact. □ 

We now develop several simple tools used in proofs later in the paper. For a given 
set J C. I and measurable /, g : I — > X, we define Dj(f,g) = sup ae j d(f(a),g(a)) 
We denote the support of a measure /i by supp(/i) . 

Lemma 2. Suppose J C. I is a measurable set of full measure. 

(i) For any measurable f,g:I—tX, there exist measurable f,g:I—* X, such 
that /jA = /jA, SjA = g t X, and D(f,g) < Dj(f,g). 

(ii) The distance A(/i, v) = inf Dj(f, g), where infimum goes over all measurable 
functions f,g : I — > X, satisfying fi — /jA, v = g$\. 

Proof, (i) Choose any x G X, and define f(a) = f{a), g(a) = g{a) for a E J, 
/(a) = <?(a) = x for a £ J. (ii) It follows from D(f, g) < Dj(f, g) < D(f, g). □ 

Lemma 3. If f,g : I — > X are measurable functions such that /i = /jA, ^ = ^jA 
/or given fi,i> e P(X), then there exist measurable functions f,]):I—>X, such that 
/i = /jjA, v = ~g$\, f{I) Csupp(/i), g{T) ^supp{v), and such that D(f,g) < D{f,g). 

Proof. Let J = / _1 (supp(/i)) n g _1 (supp(^)), and choose any to £ J. Now we 
define /(t) = f(t), g(t) = g(t) for t e J; /(t) = /(i ), flf(t) = ff(t ) for t g J. □ 

Among various (uniformly equivalent but not necessarily equivalent) definitions 
of the Hausdorff metric dn in the literature, we use the following as the most 
convenient here: if A, B are two closed subsets of X, then d{x, A) = in{{d(x, y), y € 
A}, d{A, B) = sup{rf(x, B)\xe A}, and d H (A, B) = max{,i(A, B),d(B, A)}. 
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Proposition 4. (%) dH{supp(p),supp{v)) < A(fj,,is); 

(ii) If p n d-converges to [i, then supp(p n ) converges to supp(n) in the Hausdorff 
topology. 

Proof, (i) Suppose fi, v £ P(X). For any e > there exist measurable functions 
f,g : I —t X, such that /i = /jA, v = g$A, and such that D(f,g) < A(/i, i/) + 
e. Using Lemma 02 we construct /, g, as in the Lemma. We easily check that 
df/(supp(^),supp(f)) < D(f,g) < D(f,g) < A(fj,,v) +e. Since e was arbitrary, (i) 
is proved, (ii) follows directly from (i). □ 

The claim (ii) of the Proposition|4]is not true in the weak* topology. The counter- 
example which is constructed in (|3.ip is a sequence of w-convergent measures //„, 
converging to a measure fi, but such that supports of measures \x n do not converge 
to the support of the measure fi. 

Assume that a sequence of measures p n w-converges to a measure p, and that 
the sequence of supports of measures p n converges in the Hausdorff topology to 
the support of p.. We can not then in general claim that p, n d-converges to /i. A 
counter-example is the sequence 

L n-l 

—S x H 

2n 2n 

for 6 X , 6 y as in (|3.1[) . The same example shows that we can find measures /z, v, 
such that dfr(supp(/i),supp(^)) = 0, but such that A(/i, v) is arbitrarily large. 

We now show that the dynamical topology is not much finer than the weak* 
topology. 

Proposition 5. The set of all measures in P(X) which are supported on a finite 
set is d-dense. 

Proof. Choose /i £ P{X), and any e > 0. Since X is compact, we can find a mea- 
surable pairwise disjoint cover C\, C 2 , C'k of A, such that the diameter of each C'i 
is at most e. For each 1 < i < k, choose any Xi £ Ci. Let v e = 2i=i MCi)^ , an d 
then v e is supported on the finite set {x±, Xk}. Choose any measurable / : I — > X 
such that p = /jA, and define g : I — > A with g(t) = xi for all t e / _1 (Cj). 
Now, v e = g$\, and since diameter of each Ci is at most e, D{f,g) < e, hence 
A(ji,v e )<e. □ 

With regards to the weak* topology, the set of all measures uniformly supported 
on a finite (multi)set is w-dense. No similar claim is true in the uniform topology. 

4. Characterization of convergence in the dynamical topology 

We first recall some well known properties of convergent sequences of measures. 

Proposition 6. Assume that [i n w-converges, d-converges or u-converges to /i. 
Then 

(i) For each open U C A, liminf fi n (U) > p(U). 

(ii) For each closed V C X, limsup/i„(V) < n(V). 

(Hi) For each W C A such that fJ,(d(W)) = ; then \\mp n (W) = p(W). 

Proof. The proof for w-convergence is e.g. in |13j . The rest follows from Proposition 
[2]and the definition of u-convergence. □ 

The following notion is the main tool for characterization of d-convergence. 
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Definition 3. We say that a measurable set A C X is p.- separating, if there exists 
an open set D, Cl(A) C D, such that fi(D \ A) = 0. 

Note that a /z-separating A can have measure 0. 

fn the following, e-neighborhood of a set A is the open set {x G X, such that 
3y G A, d(x, y) < e}. Note also that, since X is compact, if D is any open set such 
that Cl(A) C D, then for small e, an e-neighborhood of A is a subset of D. 

The proof of the following Lemma is an easy exercise. 

Lemma 4. Assume that A d X is a measurable set. A is fi- separating, if and only 
if for each small enough e > 0, if D is an e-neighborhood of A, then /j,(D \ A) = 0. 

Now we can characterize d- convergence. 

Theorem 2. A sequence of measures p, n d-converges to a measure fi if and only if 
the following two conditions hold: 

(i) The sequence (i n w-converges to the measure p; and 

(ii) For each p- separating set A, there is a neighborhood B of A, such that for 
any open C, CI (A) CCCJJ, there is uq such that for all n > no, fJ, n (C) = /i(C). 

We prove Theorem [2] in several steps. 

Lemma 5. Suppose fx n , [i satisfy conditions (i), (ii) of Theorem^ and choose 
any e > 0. Assume that A±, A2, A m is a cover of X, of measurable, nonempty, 
pairwise disjoint sets with diameter less or equal than e. Then we can find 5, 
< 5 < e, and an integer no, such that, if B\, B m are S -neighborhoods of 
Ax,..., A m respectively, then for all n > no, and any 1 < i\ < 12 < ... < ik < fn, 

(4.1) ii n {B lx UB i2 U...UB ik ) > t i(A n UA !2 U...UAJ. 

Proof. Let 2 be the set of all subsets of indices {ix, 12, ■■■> ik} f= {1? 2, m}, for 
which Ai x U Ai 2 U ... U Ai k is /i-separating. Now choose S < e small enough, such 
that the condition (ii) of the Theorem is satisfied in the following sense: for all 
(ii,«2) ■■■,ik) £ T, if Bx,..., B m are ^-neighborhoods of Ax,..., A m respectively, then 
there is n\ such that for all n > Tlx, 

(4.2) ^{B lx U B i2 U ... U B ih ) = /x„(Ai U B i% U ... U B ih ). 

For all (ii,*2) ■■■,ik) G since (A^ U Ai 2 U ... U Ai k ) is /i-separating, we can also 
assume that d is small enough, so that 

(4.3) ix{B h U B l2 U ... UB,J = U A l2 U ... U A ih ). 
The converse of Lemma |4] implies that if («i, Z2, u-) ^ X, 

(4.4) n(B h U B i2 U ... U B ik ) > n(A h U A l2 U ... U A lk ). 
Now, we define 

(4.5) p = min{fi(B n UB !2 U...U B ik ) - [i{A n U A l2 U ... U A ik ), (i x , i a , i k ) £ 1}. 

Then because of (|4.4p . p > 0. Now, choose ri2 such that, for all n>n 2 , and for all 
—,ik) $ 1 

(4.6) n n (B h U B i2 U ... U B ik ) > n(B h U B i2 U ... U B ik ) - p. 

Such ri2 exists because U-Bi 2 L)...L)Bi k is open, because /i n w-converges to \x, and 
because of Proposition O (i). Let no = max{ni,ri2}. Now, (|4.2p and (|4.3p imply 
(|4~T|) for (ix.ia, G T, and g3J) and gU) imply JO) for (n,i 2 , -,ik) ii- □ 
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Lemma 6. Let £ be a measure on X (positive, not necessarily a normed one), 
xi, X2, ■ x m , nonnegative real numbers, and B\, B2, ■ B m measurable subsets of 
X , such that for any 1 < i\ < i% < ... < ik < m, 



(4.7) £(-B n UB i2 U...UBiJ >x h +x l2 + ...+x ik , 

(4.8) £(X) = x x +x 2 + ... + x m . 

Then there exist measures Vx,V2, ■ v m (positive, not necessarily normed) such that 
for all i = 1, m, 

(4.9) MB?) = 0, 

(4.10) Vi (X) = x 4 , 

(4.11) Z = Vi+Vt + ... + v m . 



The proof of Lemma[6]is essentially combinatorial in character and is not related 
to the rest of the paper, so we postpone its proof to the Appendix. 

Lemma 7. Suppose fi n , fi satisfy conditions (i), (ii) of Theorem^ and choose any 
e > 0. Then there is an integer no such that for all n > no, 

A(/i n ,//) < 2e. 

Proof. We can find a finite cover Ax, A2, A m of X of measurable, nonempty, 
pairwise disjoint sets with diameter less or equal than e because of compactness 
of X. We apply Lemma [5] and find < S < e and an integer no, such that if 
Bx, B2, B m are the <5-neighbor hoods of Ax, A 2 , A m , and n > no, then (|4.ip 
holds. Choose n > no, and set Xi = fi(Ai), £ = fi n . The relation (|4.1|1 and the 
fact that (Ai) are pairwise disjoint imply (|4.7I) . As (A4) is a measurable partition 
of X, n{A x ) + ... + n(A m ) = n(X) = 1 = «„(A), which is by definition fgT5|) , Now 
applying Lemma [6] we obtain positive measures Vx, v%, v m such that 

fin = l>x+V2 + ■■■ + V m , 

<Bt) = 0, 

v%[X) — Vi(Bi) — fi(Ai). 

Let Ix = [a , ax), h = [ax, a-i), ■■■ Jm = [a m ~i,a m ] be a partition of [0, 1], = ao < 
ax < 12 < ... < a m -x < cim = 1) such that a, — aj-i = fi(Ai) for all i = 1, m (if 
fli—i = ai, then = 0). Then for all i = 1, m, 

Vi{Bi) = fi(Ai) = X(Ii). 

Now, using Theorem [TJ we construct a function / : I — > X such that f{h) Q Ai, 
i = l,...,m, and such that fi — /jA. The construction relies on the fact that 
{Ai,i — 1, ...,m} is a measurable partition of X, and fi(Ai) = 

Similarly, we can construct a function f n :I—*X such that f n {Ii) Q Bi, and 
such that /njA = fi n . We do it in the following way: let = A | Ii. We construct 
f n \ii to be any measurable function so that 

(fn\ii)nK — Vi. 

The construction implies that fit) £ Ai if and only if f n (t) £ Bi. Since Ai C Bi, 
and the diameter of Bi is at most 2e, then for any x £ Ai, y £ Bi we get y) < 2s. 
We conclude that D(f, /„) < 2e, and by definition A(fi,fi n ) < 2e. □ 

Lemma 8. If a sequence of measures fi n d-converges to a measure fi, then it sat- 
isfies (ii) from Theorem^ 
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Proof. Assume that A is /^-separating, and let D be an open set, such that Cl(A) C 
D, and such that /i(-D \ A) = 0. Let e > such that 2e-neighborhood of A is a 
subset of D. Let -B be the e-neighborhood of A, and choose an arbitrary open set 
C such that Cl(A) CCCB. Then the construction implies that 

(4.12) x G C, y G D c d(af, y) > e. 

Choose 5 < e, small enough such that the (5 neighborhood A is a subset of C. 
Now find no large enough such that for all n > no, A(/z, fi n ) < 5/2. Now we can 
find functions /, f n such that 

(4.13) D(f,f n )<5, 

and such that /i = /jA, //„ = / n jA. Definitions of A and £) imply that (i(D\A) = 0. 
Now Lemma [5J (i) implies that without loss of generality we can assume that for 
all 

(4.14) xel, f(x) <£D\A. 

Suppose f(x) G A. Then ()4.13p implies that d(f(x), f n (x)) < 5, so f n (%) is in 
8 neighborhood of A, which is a subset of C. Now, suppose f n (x) £ C. Now, 
because of ([4TT2jt and (|4~13| . /(i) ^ D c , and because of (|4~T4|) . /(x) ^ D \ A, so 
it must be f(x) G A. We deduce that / n (x) G C if and only if /(x) G A, hence 
^ n (C) = A(/- 1 (^)) = A(/- 1 (^)) = P(A). Since A is ^-separating, = 
so we deduce that for all n > n , /i„(C) = /i(C). □ 

We now prove Theorem [2] 

Proof. One implication of Theorem [2] follows from Lemma[71 the other from Propo- 
sition [2] and Lemma [8] □ 

Theorem [5] could be further modified: to check whether a w-convergent sequence 
is d-convergent, it is sufficient to prove (ii) only for closed ^-separating sets. 

The following important Corollary shows that the dynamical topology is in some 
sense sufficiently close and similar to the weak* topology, and as such is expected 
to have various applications. 

Corollary 3. Assume that X is connected. If supp fj, = X, then fj, n d-converges to 
fi if and only if it w- converges to \x. 

Proof. Indeed, if X is connected and supp /i = X, then there are no /^-separating 
sets. □ 



Now we give yet another characterization of d-convergence. By definition, a 
sequence /t„ G P(X) d-converges to a measure /t G P{X), if there exists a sequence 
of measurable functions f n ,g n : I —* X, such that /z„ = (/n)tjA, /i = (<7n)jA, and 
D(f n ,g n ) — > 0. Now we show that g n can be independent of n, and put it in the 
context of the well-known Skorokhod Theorem (see e.g. [5]). 

Theorem 3. (Skorokhod) Assume that X is metrizable, separable, and complete. 
A sequence of measures \x n G P{X) is w-convergent, if and only if there exists a 
sequence of measurable functions f n : I — ► X, and a function f : I —> X, such that 
for all a G / , f n {a) — > f(a), and such that fx n = /njA, and fi = /jA. 
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Corollary 4. Assume that X is metrizable. A sequence of measures fi n G P(X) 
is d-convergent, if and only if there exists a sequence of measurable functions f n '■ 
I — > X , and a function f : I — ► X , such that f n —*f as n — > oo, uniformly on I , 
and such that u n = / n jA, and /i = /jA. 

Proof. <J=: It follows directly from the definition of A. 

=>: Assume that u n d-converges to u. We can adjust the construction of the 
function / : I — > X in the proof of the Theorem [2] so that [i — /jA, and so that 
/ is independent of e. We do it by choosing e — 1/2™, and constructing the cover 
A\ : ...,A m for a chosen e to be a refinement of the cover A[, ...,A' m , for another 
e' > e. We can also, using the same construction, find a sequence /„ : / — > X so 
that D(f n , /) — > 0, and that /i„ = /njA, which proves the claim. □ 

5. Stability of invariant measures 

We now show that various notions of stability, including Lyapunov stability, 
attracting sets, asymptotic stability, and Nekhoroshev stability, generalize well to 
invariant measures. As in the previous sections, A is a compact metric space. In 
this section, / is a continuous function on A, and /j is then a d-continuous function 
on P(X), fifJ-(A) = /i(/ _1 (.A)) for all Borel measurable A. 

Definition 4. For all Borel measurable A C X, we call the set A$ the lift of the 

set A, defined as the set of all measures \x G P(X), such that supp(ji) C A. 

Lemma 9. Let A, B, Ai, i = 1, oo, be measurable subsets of X . Then (i) A = B 
if and only if A$ = B$; (ii) AC B if and only if A$ C B$; (Hi) (AnB)$ = A$nB$; 
H (HZiAih = nZMih; (v) (A^ C {Atf; (vi) A„UB,C(AUB) B . 

All properties (i)-(vi) follow directly from the definition of the lift. Note that 
in general in (v) and (vi) equality does not hold, so lift jj is not a morphism of set 
algebras. 

Lemma 10. If f : X — > A is continuous, then f#(A$) = f(A)$. 

Proof. C: Let /i G f$(A$). Then for some v G -P(A), /i = /ji/, and supp(^) C A. 
Since / is continuous and X compact, /(supp (y)) =supp(/jjt') =supp(/i), hence 
supp(^) C f{A). 

D: Let a G f(A) h i.e. supp( M ) C f(A). Let M " = E^x be any 

sequence of finitely supported measures which w-converges to jj,, and such that for 
all n, k, y% Gsupp(^); let a:JJ G A be any sequence such that f(x%) = yji, and let 
v be the limit point of a w-convergent subsequence of v n = EfcTi A}J<5(a;™)- Then 
supp(f) C / _1 (supp(//)) C A, and because of continuity of /, a = f(v), hence 

In the following, e-neighborhoods of sets of measures and other properties in 
P(A) are always with respect to the dynamical topology, unless specified otherwise. 
The key property of the dynamical topology is the following lemma: 

Lemma 11. Suppose AC. X is a closed set, and let U C A be an open set. Then 
U is the e -neighborhood of A if and only if '[/jj is the e -neighborhood of A$. 

Proof. =>: Denote by V the ^-neighborhood of A§ in P(X). 

(i) Claim: U$ C V. Choose any u G £7jj, and then by definition supp(/i) C [/. 
Since A is compact and supp(^i) closed, there exists S > such that supp(^) is a 
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subset of a (e — 2<5)-neighborhood of A. Now, Proposition [5] implies that there is a 
measure vg supported on a finite set {xi, Xk} such that A(/i, us) < 6. Without 
loss of generality we also assume that v&{{xi}) > for all i, and then it is easy to 
see that for all 1 < i < k, Xi is in the (e — <5)-neighborhood of A. We now choose 
2/i, ■■■,yk S A, such that d(xi,yi) < e — 5, and define v = Yli=i l/ s({xi})Sy i . Now, 
v £ j4(j, and because of the choice of Xi and y% it is A(ys,v) < s — 6, hence by 
triangle inequality A(/i, v) < e. 

(ii) Claim: V C£/j. Choose any fi £ V, and by definition of V we can find v £ Aj 
such that A(/x, v) < e. We choose 8 > such that A(^i, ^) < e — 2<5. We find /, g : 
J — > X, such that = /jA, ^ = gjA, and such that D(f, g) < A(/^, v) + 5 < e — 8. 
Applying Lemma [3l we find /, <?, as in the Lemma, and then D(f,7j) < e — 6. For 
any x Gsupp(/i), we can find t E I such that d(x, f(t)) < 5, and since by definition 
d(f(t),g(t)) < D(f,g) < e — 5, we get that d(x,g(t)) < e. By construction of g, 
g(t) Gsupp(V) C A, therefore x G U. 

It now follows from uniqueness of e-neighborhood. □ 

Lemma 111 I is not true for weak* topology, or uniform topology. 

Corollary 5. A set A C X is open (respectively closed), if and only if A^ is open 
(respectively closed). 

Proof. Lemma [Til implies that A is open if and only if A$ is open. 

Assume that A is closed, and choose any convergent sequence of measures fi n £ 
Aj, converging to /i. Proposition[6j (ii) now implies that supp(/x) C A, hence fi £ Aj 
and Ajj is closed. Now, assume that Aj is closed, and let x n be any convergent 
sequence in A, converging to x. Now by definition of d-convergence, 5 Xn converges 
to S x , and since Aj is closed, 5 X £ Aj. By definition a; £ A, so A is closed. □ 

Corollary 6. Let f : X —> X be continuous. Then a set A C X is closed and 
f -invariant if and only if Aj zs closed and f^-invariant. 

Definition 5. Lyapunov stability. Given a continuous function f on X , we 
say that a closed invariant set A of the dynamical system f is Lyapunov stable, if 
for each e > 0, there exists S > 0, such that if U, V are e, <5 neighborhoods of A 
respectively, then for all n > 0, n £ N, f n (V) C U . 

Proposition 7. Suppose f is a continuous function on X. A closed invariant set 
A is Lyapunov stable with respect to f , if and only if Aj is Lyapunov stable with 
respect to /j. 

Proof. Suppose A is Lyapunov stable, i.e. for a given e > 0, f n (V) C U for 
some 8 > and all n > 0, with U, V respectively e, S neighborhoods of A. By 
definition, f n (V) C {/ is equivalent to / n (V))j C f/j, and because of Lemma ITOl and 
f n {V)i = ffm), this is equivalent to f£(V t ) C C/j. Lemma HH states that [/, V 
are respectively e, S neighborhoods of A if and only if C/j, Vjj are respectively e, 5 
neighborhoods of Aj, which completes the proof. □ 

Definition 6. Asymptotic stability. Given a continuous function f on X , we 
say that a closed invariant set A of a dynamical system f is asymptotically stable, 
if there exists e > 0, such that for each x £ U, where U is the e-neighborhood of A, 
]im n ^ oo d(f n (x),A)=0. 
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Proposition 8. Suppose f is a continuous function on X . A closed invariant set 
A is asymptotically stable with respect to f, if and only if Aj is asymptotically stable 
with respect to fp 

Proof. =>: Suppose A is asymptotically stable. Let e > 0, U, be as in the definition 
of the asymptotic stability, and choose any S > 0. Now, because of compactness of 
X, there is no such that for all n > no, f n (U) C V, where V is the ^-neighborhood 
of A. This, and Lemma [TT1 imply that for n > uq, and any measure fi G Up f$(p) 
is in the <5- neighborhood of Aj . Since 6 was arbitrary, Aj is indeed asymptotically 
stable. 

<=: Assume that Aj is asymptotically stable, and choose e > 0, C/j, as in the defi- 
nition of asymptotic stability, where because of Lemma [TT1 U is the e-neighborhood 
of A. By definition, if x G U, then S x G {/j, and then A(f^(S x ), Ap) — » as n — > oo. 
That, and the relation /^(Sx) — Sfn^ now imply that d(f n (x), A) — > as n — > oo, 
which proves that A is asymptotically stable. □ 

Somewhat stronger property than asymptotic stability is that of an attractor. 

Definition 7. Attractor. Given a continuous function f on X , a closed invariant 
set A of the dynamical system f is an attractor, if there exist e > 0, and N > 0, 
such that ifU is e -neighborhood of A, then f N (U) C U, and A = r\^ =1 f n (U). 

(There are various definitions of attractor in the literature; we have chosen the 
definition from [5].) 

Proposition 9. Suppose f is a continuous function on X . A closed invariant set 
A C X is an attractor with respect to f, if and only if 'Aj is an attractor with respect 
to fp 

Proof. The Proposition follows from the following equivalences: (I) Lemma [TT] im- 
plies that t/j is the e-neighborhood of Ap if and only if U is the e-neighborhood 
of A; (II) Lemma IH1 (ii) and Lemma [Tul imply that f N (U) C U if and only if 
ff(Up) Q Up and (III) Lemma H (iv) and Lemma [TO] imply that nf =1 ff{Up) = 
(n%LJ n (U))p That, and LemmaEJ (i), now imply that A = n£° =1 /"(f7) if and 
only if Aj = n^ =1 f^(Up). ' □ 

An analogous claim holds for repellers, sinks and for sources. 

Definition 8. Exponential stability. Given a continuous function f on X , a 
closed invariant set A of the dynamical system f is exponentially stable, if there 
exist constants C , A > 0, and e > 0, such that for each < 5 < e, if U is a closed 
S neighborhood of A, then, d H {A,f n (U)) < Ce- Xn d H (A,U). 

We now introduce notation related to the Hausdorff metric induced by A on 
closed subsets of P(X). Given /x s P(X), we set A(/i,A) = inf{A(|U, i/), v G A}, 
A(A, B) = sup{A(^, B)\n& A}, and then A H (A, B) = max{A(A, B), A(B, A)} is 
the Hausdorff metric. 

Lemma 12. Let U, V C X be closed sets. Then d H (U, V) = A H (Up Vp). 

Proof. Lemma[3] implies that A(/x, v) < sup^g^pp^) d(x,supp(u)), hence A(/i, Vj) < 
sup^gsupp^) d(x, V). We now deduce that for ±i G Up A(/i, VJ) < sup^g^ d(x, V) = 
d(U, V), and by taking supremum over \i G Up we get A(Up Vp) < d(U, V). 
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The definition of A and Lemma [3] easily imply that A(6 X ,V$) = d(x,V), hence 
M u h v t) > snp xeU A(S x ,V i ) = sup xeU A(x, V) = d(U,V). We conclude that 
A(t/|j, Vjj) = d(U, V), and similarly A(Vj, U$) = d(V, U), which completes the proof. 

□ 

Proposition 10. Suppose f is a continuous function on X . A closed invariant set 
A is exponentially stable with respect to f, if and only if Aj is exponentially stable 
with respect to fa. 

Proof. It follows directly from the definition of exponential stability, Lemma [Til and 
Lemma [TJ] □ 

A similar claim can be also proven for Nekhoroshev stability (see e.g. [TO]). 

We now give an example which shows that the claims above do not hold for 
uniform or weak* topology. In other words, we show that uniform or weak* topology 
on P(X) are not the right topologies for generalizing notions of stability to spaces 
of measures. 

Example 2. Assume that f is a dynamical system with one attracting sink x, and 
one source y. Such a dynamical system can be constructed on a, say, 2-sphere, with 
the sink and the source being the poles. We denote by S x and S y the probability 
measures concentrated on x, y respectively, and we define fi s = (1 — e)S x + eS y , for 
a given e > . Now, for small e > 0, /i £ is arbitrarily u-close and w-close to 8 X 
(but not d-close!). However, [i e is a fa-fixed point. We conclude that {S x } = 
is neither an attractor, nor asymptotically stable set for fa in weak* or uniform 
topology on P{X). 

Now, we naturally generalize the notions of stability to /-invariant measures, or 
more generally to /j-invariant sets of measures. (Note that if is an /-invariant 
measure, then [i is /jj-fixed point, and {//} is a /jj-invariant closed set.) 

Definition 9. Suppose f is a continuous function on X. 

We say that a f -invariant measure [i is Lyapunov stable, asymptotically sta- 
ble, an attractor, or exponentially stable, if {[i} is Lyapunov stable, asymptotically 
stable, an attractor, or exponentially stable respectively, with respect to fa and the 
dynamical topology on P{X). 

More generally, if A is a fa-invariant, d-closed set of measures, we say that it is 
Lyapunov stable, asymptotically stable, an attractor, or exponentially stable, if it is 
so with respect to fa and the dynamical topology on P(X). 

An information on stability of invariant measures gives much more information 
on dynamics in a neighborhood of a set (which can be the support of an invariant 
measure) , then the information on stability of invariant sets. The following example 
illustrates that claim. 

Example 3. Let X = M? /I? be a 2-torus, and we define a function f(x,y) = 
(x' , y') with 

x' = x + y, 

y' = v, 

(a standard map with k = 0). Let A be the circle y — 0. Then A is a closed 
invariant set (all points on A are fixed), and also Lyapunov stable. Let A be the 
Lebesgue measure on A, and v any probability measure supported on A, different 
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from X. Both X, v are j ' -invariant, and supported on a Lyapunov stable set A. Now, 
one can check that X is Lyapunov stable, and v is not. This reflects the fact that 
the rotation in the vicinity of A is "with uniform speed". Similar examples could be 
constructed for attractors and exponentially stable sets. 

6. The optimal transport problem 

In this section we define the oo-optimal transport problem, and show that the 
metric on the set of measures induced by the oo-optimal transport problem is 
equal to the dynamical A metric. We then deduce that the oo-optimal transport 
problem generates a different structure on the set of of measures than the p-optimal 
transport problem for any 1 < p < oo. 

Definition 10. Let X be a metric space with a metric d, and ji, v be two Borel 
probability measures on X . 

The set of all transports T(/i, v) of measures fj,, v is the set of all Borel probability 
measures 7 on X x X, such that iti^"/ — [i, 7T2|)7 = v, where tt\, 7T2 are projections 
of X x X to the first, resp. second variable. 

Distance of measures [i, v with respect to a transport 7 G T(/x, v) is defined with 

A 7 (/i, v) = inf {sup{d(a:, y)\(x,y) G A} | A C X x X measurable, and j(A) = 1}. 

The 00 -Wasserstein distance of two measures fi, v is defined with 

(6.1) A^/i, v) = inf{A 7 (/x, v) | 7 G T( M , v)}. 

If the minimum in h6.1\) is attained, then any measure 7 for which A 7 (/i, v) = 
Aoo(/x, v) is called a solution of the optimal transport problem with respect to the 
measures /i, v. 

Now we prove that the 00- Wasserstein distance is the same as the dynamical 
metric A. 

Proposition 11. Given two probability measures n,v, then Aoo(/i, v) = A(/i, v). 

Proof. Claim: A OQ (/i, v) < A(/i, v). Choose any measurable f,g:I—>X, f$X = /i, 
gjA = v. Then we define 7 := (/ x g)$X, where (/ x g)(x) :— (f(x),g(x)). The 
assumptions imply that 7 G T(/i, v), and then 

AooO,^) < A 7 (/x,z>) < SMv{d{x,y)\(x,y) G (/ X g)(I)} = D(f, g). 

Now, it is sufficient to take infimum of the right hand side of the equation above 
over all /, g, such that /jA = /i, gjA = v. 

Claim: A([i,v) < Aoo(/i, v). Suppose 7 G T(fi,u). By applying Theorem[TJ we 
can find a function h : I —> X x X, such that 7 = /ijA. We define / = %± o h, 
g = 7T2 o h, and then [i = /jA, v = g^X. Let A C X x X be any set such that 
j(A) = 1, and let J = h~ 1 (A). Now definitions and Lemma [2] imply that 

A(fM, v) < Dj(f, g) = sup{d(a;, y) \ (x, y) G h(J)} 

= sup{d(x,y) I (x,y) G A}. 

The proof is completed by taking the infimum of the right hand side of the 
equation above over all A, 7. □ 

Corollary 7. Assume that X has at least two elements. Then oo-Wasserstein 
metric Aqq — A is neither uniformly nor topologically equivalent to any of the 
p- Wasserstein metrices W p for 1 < p < 00. 
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Proof. For any p, 1 < p < oo, the metric W p is uniformly equivalent to the 
Prokhorov metric on P(X) (see e.g. [6]), hence topologically equivalent to the 
weak* topology. Corollary^ (i) implies the claim. □ 

We close this Section with a proof that the co-optimal transport problem has a 
solution. 

Proposition 12. There exists a measure 70 G T(p,v), for which the minimum of 
the expression H6.1\) is attained. 

Proof. For given p, v 6 P(X), we first prove that the map 7 1 — ► A 7 (/z, v) is w- 
lower semi continuous. Since d : X x X — > M is continuous, {sup{d(x,y) G 
A} = {sup{d(x, y) \{x,y) G CI (A)}, and 

(6.2) A 7 (/i, 1/) = {sup{d(x, y) | (x, y) G supp 7}. 

Choose any sequence of measures j n — > 70, convergent in w-topology, as n — > 00. 
Now for any e > 0, and any (a;, y) G supp 70, let B e be an open e-ball around (x, y), 
and then by definition of the support jq(B s ) > 0. Since liminf "f n (B £ ) > jq(B £ ) > 
0, (|6.2p implies that there is no such that for all n > no, j n (B e ) > 0. For each n > 
no, there is a point (x', ?/) G supp 7„ fl B e , and then because of (|6.2j) . A 7n (u, i/) > 
cz^x', y') > d(x, y) — 2e. Since e was arbitrary, we conclude that lim inf A 7n («, v) > 
d(x,y). Since (x,y) G supp 70 was arbitrary, we get liminf A 7n (u, z/) > A 7o (/i, z/). 

Suppose now 7„ G v) is a sequence of transports such that A 7ri (u, v) < 
Aoo(", i 7 ) + Now 7„ has a w-convergent subseqence, which converges to an- 
other transport 70. By definition A 7o (/z, v) > Aoo(/i, v), and because of w-lower 
semicontinuity, A 7o (/z, 1/) < A DO (/i, 1/) , so 70 is the required measure. □ 

Corollary 8. For given probability measures p, v , there exist measurable functions 
f, g : I -> X, f t X = p, g t X = v, such that A(p, v) = D(f, g). 

Proof. If 70 G T(/i, v) is the measure for which (|6.1|) is minimal, and h : I —* X XX, 
h a measurable function such that 70 = /ij A, then / = 7Ti o h, g — it2 ° h are required 
functions. □ 



7. Appendix: The proof of Lemma [B] 

In this Appendix we prove the fact from the proof of Theorem [2J which is es- 
sentially combinatorial in character. Assume that an integer m, any family of 
measurable subsets (Bi,B 2 , ...,B m ) of X, and a measure £ are given. We first in- 
troduce some notation and definitions. Let V({1, m}) be the set of all subsets 
of {1, m}, and for a nonempty (p G "P({1, m}), 

b v ■= B n n b 12 n ... n B lk n nfl?n...n , 

where 99 = {ii,i2, •■■ ) ife}, f C = {ji , 32, •■■jjm-fc}- Let |<^| denote the cardinal num- 
ber of ip. We define the arrangement of (_Bi, S 2 , ...,B m ) to be p(B 1 ,B 2 , ...,B m ) = 

EfcLiPfc' where 

Pfc = |{^> C {1, m} : |p| = k, and > 0}| 

(The arrangement p depends also on the measure £, which we omit from the ar- 
gument of p because it is always clear which measure is being considered). The 
k th arrangement pk shows how many sets of intersections of exactly k sets Bi have 
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positive measure £. The arrangement p is the number of such sets B v for any k > 0, 
such that they have positive measure £, and then 



Now we prove Lemma [51 

Proof. Let £ be a measure on X (positive, not necessarily a normed one) , Xi , x% , x f 
nonnegative real numbers, and Bi,B2, ...,B m measurable subsets of X, such that 

flUD , SS> hold. 

We prove the claim inductively, with respect to two integers m, p, where m is 
the number of sets Bj, i = 1, to, and p their arrangement p = p(B\, B2, B m ). 

The basis of the induction is the case m = 1, p = 1. Then z/i := £ clearly satisfies 
gH), (jLTO) . (HH]). 

Now, for a given m, p, assume that the claim is true for any family of measurable 
sets (-Bi)<=i,...,TO', where m' < m, p' — p(B[, ....,B' m ,) < p, and either to' < to or 
p' < p. Choose a family of measurable sets Bi, B2, B m , with the arrangement p, 
and assume that Xi,X2, ■■■,x- m are nonnegative real numbers satisfying (|4.7|) , (|4.8|) . 
We analyze three cases: 

Case 1: There exist t = i2, ifc}, 1 < A; < m — 1, sucft i/iai i/iere is equality 
in |^.7[ ) ; namely that 



where £ | ^4 is the measure £ | j4(Y) = £(^4 n Y). It is easy to check that (Bj), 
(x'j), and also (B'J), (x"), £" satisfy (|4~7|) . and (|4~5|) . By applying the inductive 
assumption, we can find v[, i = 1, and f", i = 1, ...,m — fc, satisfying (|4.9[) . 
(l4~TD|l . 631]). We define 



and then ....,f m satisfy (1431) , (HUHl), (|4~TT|) . 

Case 2: There exists k, 1 < fc < m, such that Xk = 0. 

We define Uk — 0. We set Bj = Bj, x'j — Xj, for j = 1, k — 1, and Bj = B? + i, 
xj = for j = k + l,...,m. Then (Bj), (xj), £ satisfy (|477|) . and ijO]) . and 

by applying the inductive assumption we find i/, i = 1, ...,m — 1, satisfying (|4.9p . 
(|4.10jl . (|4.1ip . Then we set v 3 = z/, for j = l,...,fc - 1, and v 3 = i/j_ l5 for 
j = fc + 1, to, and so prove the claim. 

Case 5: For a/Z 1 < k < to — 1, and aH subsets i%, ifc} C {1, to}, 



(7.1) 



p < 2 m - 1. 



£(B 81 UB ij U...UB i J=3; il +i ij +... + x ik . 



Let {jfc+i,ife+2, ...,i"m} = 0i>^2, ■■■,«fc} C - We define 





£(Bi! U B i2 U ... U B ifc ) > x ix + x l2 + ... + x. 
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and also for all 1 < k < m, Xk > 0. 

We rewrite the relations (14. 7p . using the introduced notation: for all 1 < ii < 
i2 < ... < ifc < m, 1 < k < m — 1, 

(7.2) £(B il UBi 2 U...UB ih )= £(B V ) > x h + x i2 + ... + x ih , 

ipn{i! ,12,— ,ifc}7 s 

where ip is any nonempty C {1, m}. Let 

5{i u i 2 ,...,i k } = £,{B ll UB, 2 U ...UB»J - x n - a; 42 - ... - 
We now define 

(7.3) 6 = min{<5 v , 1 < \<p\ < m — 1}, 

and let l = {ji, ...,jk}, 1 < k < m — 1, be the set for which (|7.3[) is minimal. The 
assumption of the Case 3 implies that S > 0. Choose any i/> such that ^(B^) > 0. 
(Note that the assumptions of the Case 3 do not apply that for all nonempty ip, 
£,{B V ) > 0.) Let p be any p £ ip. 

For some e (to be chosen later), we define: 

B'i = B t , 

Xi — e if i = p, 
x l if i 7^ p, 

and then (|7.2[) implies that 

^r'mr'm llRM J((Bi,uB jj u...uB ik )- E if (i 1 ,...,4)nv^0, 
^ 11 12 £(A 1 u5 i2 u...uB i J if (i 1 ,...,4)nv = 0. 

Let £o be the maximal e, such that for all nonempty i 2 , ik} Q {1, ...,m}, 
1 < fc < m — 1, 

(7.4) U B' i2 U ... UB!)> < + < + ... + < . 

Such e > exists because of 6 > 0, and for e = e , there is equality in (|T.4|) for 
some i2, ife}, 1 < fc < m — 1. We now fix e to be 

e = min{e , x p , £(-&,/>)}■ 
The relation (f7T4|) and the fact that e < e imply that (B'j), (x'j), £' satisfy ([4~7| : 
s < x p implies that (x£), i = 1, ...,m are nonnegative; and e < £(£>i/>) implies that 
£' is well defined (i.e. a nonnegative) measure. 

By exchanging (Bj), (xj), £ with (-Bj), ( x j), C in f|4.8|) we subtract e from both 
sides, so (|4.8[) still holds for (-Bj), (a;!-), We claim now that by applying the 
assumption of the induction, we can construct v[, v' 2 , v' m , satisfying (14.91) . (j4. 1 0[) . 
(|4.11|) . We again discuss three cases: 

Case 3.1. e = Eq. Then for some {ii,«2, ••• J *fe} ) 1 < k < m — 1, there is equality 
in \4- 7\ ), so the problem reduces to the problem analyzed in the Case 1. 

Case 3.2. e = x p . Then x' p = 0, so the problem reduces to the problem analyzed 
in the Case 2. 

Case 3.3. e = ^{B 4> ). In this case, = 0, so p(B[, B 2 , B' m ) = p(B u B 2 , B m )- 

1, hence we can construct v[, v 2 , v' m inductively. 



STABILITY OF INVARIANT MEASURES 
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Now we define 

v[ if i ^ p. 

Then the construction implies that i/j, i = 1, m satisfy (|4.9p . (14. 10[) . (|4.1ip . □ 
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